NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Verification-Guided Shielding for Deep Reinforcement Learning

Corsi, Davide; Amir, Guy; Rodríguez, Andoni; Sánchez, César; Katz, Guy; Fox, Roy (August 2024, . CoRR abs/2406.06507 (2024))

In recent years, Deep Reinforcement Learning (DRL) has emerged as an effective approach to solving real-world tasks. However, despite their successes, DRL-based policies suffer from poor reliability, which limits their deployment in safety-critical domains. Various methods have been put forth to address this issue by providing formal safety guarantees. Two main approaches include shielding and verification. While shielding ensures the safe behavior of the policy by employing an external online component (i.e., a “shield”) that overrides potentially dangerous actions, this approach has a significant computational cost as the shield must be invoked at runtime to validate every decision. On the other hand, verification is an offline process that can identify policies that are unsafe, prior to their deployment, yet, without providing alternative actions when such a policy is deemed unsafe. In this work, we present verification-guided shielding — a novel approach that bridges the DRL reliability gap by integrating these two methods. Our approach combines both formal and probabilistic verification tools to partition the input domain into safe and unsafe regions. In addition, we employ clustering and symbolic representation procedures that compress the unsafe regions into a compact representation. This, in turn, allows to temporarily activate the shield solely in (potentially) unsafe regions, in an efficient manner. Our novel approach allows to significantly reduce runtime overhead while still preserving formal safety guarantees. We extensively evaluate our approach on two benchmarks from the robotic navigation domain, as well as provide an in-depth analysis of its scalability and completeness. 1
more » « less
Full Text Available
Formally Verifying Deep Reinforcement Learning Controllers with Lyapunov Barrier Certificates

Mandal, Udayan; Amir, Guy; Wu, Haoze; Daukantas, Ieva; Newell, Fletcher Lee; Ravaioli, Umberto J; Meng, Baoluo; Durling, Michael; Ganai, Milan; Shim, Tobey; et al (October 2024, TU Wien Academic Press)
Narodytska, Nina; Ruemmer, Philipp (Ed.)
Deep reinforcement learning (DRL) is a powerful machine learning paradigm for generating agents that control autonomous systems. However, the “black box” nature of DRL agents limits their deployment in real-world safety-critical applications. A promising approach for providing strong guarantees on an agent's behavior is to use Neural Lyapunov Barrier (NLB) certifcates, which are learned functions over the system whose properties indirectly imply that an agent behaves as desired. However, NLB-based certifcates are typically diffcult to learn and even more diffcult to verify, especially for complex systems. In this work, we present a novel method for training and verifying NLB-based certifcates for discrete-time systems. Specifcally, we introduce a technique for certifcate composition, which simplifes the verifcation of highly-complex systems by strategically designing a sequence of certifcates. When jointly verifed with neural network verifcation engines, these certifcates provide a formal guarantee that a DRL agent both achieves its goals and avoids unsafe behavior. Furthermore, we introduce a technique for certifcate fltering, which signifcantly simplifes the process of producing formally verifed certifcates. We demonstrate the merits of our approach with a case study on providing safety and liveness guarantees for a DRL-controlled spacecraft.
more » « less
Full Text Available
Safe and Reliable Training of Learning-Based Aerospace Controllers

https://doi.org/10.1109/DASC62030.2024.10749499

Mandal, Udayan; Amir, Guy; Wu, Haoze; Daukantas, Ieva; Newell, Fletcher Lee; Ravaioli, Umberto; Meng, Baoluo; Durling, Michael; Hobbs, Kerianne; Ganai, Milan; et al (September 2024, IEEE)

In recent years, deep reinforcement learning (DRL) approaches have generated highly successful controllers for a myriad of complex domains. However, the opaque nature of these models limits their applicability in aerospace systems and sasfety-critical domains, in which a single mistake can have dire consequences. In this paper, we present novel advancements in both the training and verification of DRL controllers, which can help ensure their safe behavior. We showcase a design-for-verification approach utilizing k-induction and demonstrate its use in verifying liveness properties. In addition, we also give a brief overview of neural Lyapunov Barrier certificates and summarize their capabilities on a case study. Finally, we describe several other novel reachability-based approaches which, despite failing to provide guarantees of interest, could be effective for verification of other DRL systems, and could be of further interest to the community.
more » « less
Full Text Available
Marabou 2.0: A Versatile Formal Analyzer of Neural Networks

https://doi.org/10.1007/978-3-031-65630-9_13

Wu, Haoze; Isac, Omri; Zeljić, Aleksandar; Tagomori, Teruhiro; Daggitt, Matthew; Kokke, Wen; Refaeli, Idan; Amir, Guy; Julian, Kyle; Bassan, Shahaf; et al (January 2024, Springer Nature Switzerland)
Gurfinkel, Arie; Ganesh, Vijay (Ed.)
Abstract This paper serves as a comprehensive system description of version 2.0 of the Marabou framework for formal analysis of neural networks. We discuss the tool’s architectural design and highlight the major features and components introduced since its initial release.
more » « less
Full Text Available
An SMT-Based Approach for Verifying Binarized Neural Networks

https://doi.org/10.1007/978-3-030-72013-1_11

Amir, Guy; Wu, Haoze; Barrett, Clark; Katz, Guy (March 2021, Proceedings of the 27th International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS '21))
Groote, Jan Friso; Larsen, Kim Guldstrand (Ed.)
Deep learning has emerged as an effective approach for creating modern software systems, with neural networks often surpassing hand-crafted systems. Unfortunately, neural networks are known to suffer from various safety and security issues. Formal verification is a promising avenue for tackling this difficulty, by formally certifying that networks are correct. We propose an SMT-based technique for verifying binarized neural networks — a popular kind of neural network, where some weights have been binarized in order to render the neural network more memory and energy efficient, and quicker to evaluate. One novelty of our technique is that it allows the verification of neural networks that include both binarized and non-binarized components. Neural network verification is computationally very difficult, and so we propose here various optimizations, integrated into our SMT procedure as deduction steps, as well as an approach for parallelizing verification queries. We implement our technique as an extension to the Marabou framework, and use it to evaluate the approach on popular binarized neural network architectures.
more » « less
Full Text Available

Search for: All records